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Identification of a new human coronavirus 


Lia van der Hoek 1 , Krzysztof Pyre 1 , Maarten F Jebbink 1 , Wilma Vermeulen-Oost 2 , Ron J M Berkhout 2 , 

Katja C Wolthers 1 , Pauline M E Wertheim-van Dillen 3 , Jos Kaandorp 4 , Joke Spaargaren 2 & Ben Berkhout 1 

Three human coronaviruses are known to exist: human coronavirus 229E (HCoV-229E), HCoV-OC43 and severe acute respiratory 
syndrome (SARS)-associated coronavirus (SARS-CoV). Here we report the identification of a fourth human coronavirus, HCoV- 
NL63, using a new method of virus discovery. The virus was isolated from a 7-month-old child suffering from bronchiolitis and 
conjunctivitis. The complete genome sequence indicates that this virus is not a recombinant, but rather a new group 1 
coronavirus. The in vitro host cell range of HCoV-NL63 is notable because it replicates on tertiary monkey kidney cells and the 
monkey kidney LLC-MK2 cell line. The viral genome contains distinctive features, including a unique N-terminal fragment within 
the spike protein. Screening of clinical specimens from individuals suffering from respiratory illness identified seven additional 
HCoV-NL63-infected individuals, indicating that the virus was widely spread within the human population. 


To date, there is still a variety of human diseases with unknown eti¬ 
ology. A viral origin has been suggested for many of these diseases, 
emphasizing the importance of a continuous search for new 
viruses 1-3 . Major difficulties are encountered, however, when 
searching for new viruses. First, some viruses do not replicate 
in vitro , at least not in the cells that are commonly used in viral diag¬ 
nostics. Second, for those viruses that do replicate in vitro and cause 
a cytopathic effect (CPE), the subsequent virus identification meth¬ 
ods may fail. Antibodies raised against known viruses may not rec¬ 
ognize the cultured virus, and virus-specific PCR methods may not 
amplify the new viral genome. To solve both problems, we devel¬ 
oped a new method for virus discovery based on the cDNA- 
amplified restriction fragment-length polymorphism technique 
(cDNA-AFLP 4 ). Here we report the identification of a new 
coronavirus using this method of Virus-Discovery-cDNA- 
AFLP (VIDISCA). 

Coronaviruses, a genus of the Coronaviridae family, are enveloped 
viruses with a large plus-strand RNA genome. The genomic RNA is 
27-32 kb in size, capped and polyadenylated. Three serologically dis¬ 
tinct groups of coronaviruses have been described. Within each 
group, viruses are characterized by their host range and genome 
sequence. Coronaviruses have been identified in mice, rats, chickens, 
turkeys, swine, dogs, cats, rabbits, horses, cattle and humans, and can 
cause a variety of severe diseases including gastroenteritis and respi¬ 
ratory tract diseases 5,6 . Three human coronaviruses have been stud¬ 
ied in detail. HCoV-229E and HCoV-OC43 were identified in the 
mid-1960s, and are known to cause the common cold 7-15 . The 
recently identified SARS-CoV causes a life-threatening pneumonia, 
and is the most pathogenic human coronavirus identified thus 
f ar i6-i8 SARS-CoV is likely to reside in an animal reservoir, and has 
recently initiated the epidemic in humans through zoonotic 


transmission 19,20 . It has been suggested that SARS-CoV is the first 
member of a fourth group of coronaviruses, or that it is an outlier of 
group 2 (refs. 21, 22). 

The new coronavirus that we present here was isolated from a child 
suffering from bronchiolitis and conjunctivitis. This was not an 
isolated case, as we identified the virus in clinical specimens from 
seven additional individuals, both infants and adults, during the last 
winter season. We also resolved the complete sequence of the viral 
genome, which revealed several unique features. 

RESULTS 

Virus isolation from a child with acute respiratory disease 

In January 2003, a 7-month-old child was admitted to the hospital 
with coryza, conjunctivitis and fever. Chest radiography revealed typi¬ 
cal features of bronchiolitis. A nasopharyngeal aspirate specimen was 
collected 5 d after the onset of disease (sample NL63). Diagnostic tests 
for respiratory syncytial virus, adenovirus, influenza viruses A and B, 
parainfluenza virus types 1, 2 and 3, rhinovirus, enterovirus, 
HCoV-229E and HCoV-OC43 yielded negative results. The clinical 
sample was subsequently inoculated onto human fetal lung fibro¬ 
blasts, tertiary monkey kidney cells (Cynomolgus monkey) and HeFa 
cells. CPE was detected exclusively on tertiary monkey kidney cells, 
and was first noted 8 d after inoculation. The CPE was diffuse, with a 
refractive appearance in the affected cells followed by cell detachment. 
More pronounced CPE was observed upon passage onto the monkey 
kidney cell line LLC-MK2, with overall cell rounding and moderate 
cell enlargement (Supplementary Fig. 1 online). Additional subcul¬ 
tures on human fetal lung fibroblasts, rhabdomyosarcoma cells and 
Vero cells remained negative for CPE. Immunofluorescence assays to 
detect respiratory syncytial virus, adenovirus, influenza viruses A and 
B, and parainfluenza virus types 1, 2 and 3 remained negative. Acid 
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Figure 1 The VIDISCA method, (a) Schematic overview of steps in VIDISCA 
method, (b) Examples of VIDISCA-mediated virus identification. Specimens 
were analyzed using ethidium bromide-stained agarose (parvovirus B19) or 
Metaphor agarose (HBV and HIV-1) gel electrophoresis. Lane M, DNA 
molecular weight markers; negative controls; +, VIDISCA PCR products 
for HBV (amplified with primers /-///7P1I-T/Msel-T), parvovirus B19 (HinPl\ 
standard primer only) or HIV-1 (EcoRI-A/Msel-C primers), (c) VIDISCA PCR 
products for HCoV-NL63. HinPl\-G and Mse l-A primers were used for 
selective amplification; products were visualized by Metaphor agarose gel 
electrophoresis. Lanes 1 and 2, duplicate PCR product of cultured HCoV- 
NL63 harvested from LLC-MK2 cells; 3 and 4, duplicate control 
supernatant from uninfected LLC-MK2 cells; 5 and 6, duplicate negative 
controls containing water; M, 25-bp molecular weight marker. Arrow 
indicates HCoV-NL63 fragment that was excised from gel and sequenced. 


lability and chloroform sensitivity tests indicated that the virus was 
most likely enveloped, and did not belong to the picornavirus group 23 . 

Virus discovery by the VIDISCA method 

Identification of unknown pathogens using molecular biology tools is 
difficult because the target sequence is not known, so genome-specific 
PCR primers cannot be designed. To overcome this problem, we 
developed the VIDISCA method based on the cDNA-AFLP tech¬ 
nique 4 . The advantage of VIDISCA is that prior knowledge of the 
sequence is not required, as the presence of restriction enzyme sites is 
sufficient to guarantee PCR amplification. The input sample can be 
either blood plasma or serum, or culture supernatant. Whereas 
cDNA-AFLP starts with isolated mRNA, VIDISCA begins with a 
treatment to selectively enrich for viral nucleic acid, including a cen¬ 
trifugation step to remove residual cells and mitochondria (Fig. la). A 
DNase treatment is also used to remove interfering chromosomal and 
mitochondrial DNA from degraded cells (viral nucleic acid is pro¬ 
tected within the viral particle). Finally, by choosing frequently cut¬ 
ting restriction enzymes, the method can be fine-tuned such that 
most viruses will be amplified. We were able to amplify viral nucleic 
acids in EDTA-treated plasma from a person with hepatitis B viral 
infection, and from a person with an acute parvovirus B19 infection 
(Fig. lb). The technique can also detect HIV-1 in cell culture, demon¬ 
strating its capacity to identify both RNA and DNA viruses (Fig. lb). 

The supernatant of the CPE-positive LLC-MK2 culture NL63 was 
analyzed by VIDISCA. The supernatant of uninfected cells was used 
as a negative control. After the second PCR amplification step, unique 
and prominent DNA fragments were present in the test sample but 
not in the control (1 of 16 selective PCR reactions is shown in Fig. lc). 
These fragments were cloned and sequenced. Thirteen of 16 frag¬ 
ments showed sequence similarity to members of the coronavirus 
family, but significant sequence divergence with known coronaviruses 
was apparent in all fragments, indicating that we had identified a new 
coronavirus. The sequences of the 13 VIDISCA fragments are pro¬ 
vided in Supplementary Figure 2 online. 


Detection of HCoV-NL63 in patient specimens 

To show that HCoV-NL63 originated from the nasopharyngeal 
aspirate of the child, we designed a diagnostic RT-PCR that specifi¬ 
cally detects HCoV-NL63. This test confirmed the presence of 
HCoV-NL63 in the clinical sample. The sequence of the RT-PCR 
product of the lb gene was identical to that of the virus identified 
upon in vitro passage in LLC-MK2 cells (data not shown). 

Having confirmed that the cultured coronavirus originated from 
the child, the question remained as to whether this was an isolated 
clinical case, or whether HCoV-NL63 is circulating in humans. To 
address this question, we used two diagnostic RT-PCR assays to exam¬ 
ine respiratory specimens of hospitalized individuals and those 
visiting the outpatient clinic between December 2002 and August 
2003 (Fig. 2). We identified seven additional individuals carrying 



(0 

O- Dec Jan Feb Mar Apr May Jun Jul Aug 

02 03 03 03 03 03 03 03 03 



Dec Jan Feb Mar Apr May Jun Jul Aug 

02 03 03 03 03 03 03 03 03 


Figure 2 Detection of HCoV-NL63 in winter months of 2002 and 2003. 
(a) Number of patients tested per month, (b) Percentage of patients 
positive for HCoV-NL63. 
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HCoV-NL63 (Table 1). Sequence analysis of the PCR products indi¬ 
cated the presence of a few characteristic point mutations in several 
samples, suggesting that several viruses with different molecular 
markers may be cocirculating (Fig. 3 and Supplementary Fig. 3 
online). At least five of the HCoV-NL63-positive individuals suffered 
from respiratory tract illness; the clinical data of two individuals was 
not available. Including the index case, five of the patients were chil¬ 
dren less than 1 year old, and three patients were adults. Two adults 
were likely to be immunosuppressed, as one of them was a bone mar¬ 
row transplant recipient and the other an HIV-positive patient suffer¬ 
ing from AIDS, with very low CD4 + cell counts (Table 1). No clinical 
data was available for the third adult. One patient was coinfected with 
respiratory syncytial virus (no. 72), and the HIV-infected patient 
(no. 466) carried Pneumocystis carinii. No other respiratory agent was 
found in the other patients, suggesting that the respiratory symptoms 
were caused by HCoV-NL63. All positive samples were collected dur¬ 
ing the last winter season, with a detection frequency of 7% in January 
2003. None of the 306 samples collected in the spring and summer of 
2003 contained HCoV-NL63 (P < 0.01 by two-tailed f test). 

Complete genome analysis of HCoV-NL63 

The genomes of coronaviruses have a characteristic organization. The 
5' two-thirds contain the la and lb genes that encode the nonstruc- 
tural polyproteins, followed by the genes encoding four structural 
proteins: spike (S), envelope (E), membrane (M) and nucleocapsid 
(N). The genomes of known coronaviruses contain a variable number 
of unique characteristic open reading frames (ORFs) encoding non- 
structural proteins either between the lb and S genes, between the 
S and E genes, between the M and N genes, or downstream of the 
N gene. 

To determine whether the HCoV-NL63 genome organization 
shares these characteristics, we constructed a cDNA library with puri¬ 
fied virus stock as input material. A total of 475 genome fragments 
were analyzed, with an average coverage of seven sequences per 
nucleotide. Specific PCR reactions were designed to fill in gaps and to 
sequence regions with low-quality sequence data. We combined this 
with 5' and 3' rapid amplification of cDNA ends to resolve the com¬ 
plete HCoV-NL63 genome sequence. 

The RNA genome of HCoV-NL63 consists of 27,553 nucleotides 
and a poly-A tail. With a GC content of 34%, HCoV-NL63 has the 
lowest GC content among the Coronaviridae , which range from 
37-42% (ref. 24). ZCurve software was used to identify the ORFs 25 , 
and the genome configuration was portrayed using the similarity 
with known coronaviruses as a guide (Fig. 4a and Supplementary 
Table 1 online). Short untranslated regions (UTRs) of 286 and 287 
nucleotides are present at the 5' and 3' termini, respectively. The la 
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Figure 3 Phylogenetic analysis of RT-PCR sequences of the la gene from 
HCoV-NL63-positive patients. HCoV-229E was used to root the tree. 


Table 1 Patients positive for HCoV-NL63 


Patient no. 

Age 

Symptoms 

Specimen 

Sample date 

72 

4 months 

URTI 

NPA 

31 Dec 2002 

251 

67 years 

Unknown 

OPA 

7 Jan 2003 

223 

52 years 

Unknown 13 

OPA 

8 Jan 2003 

NL63 a 

7 months 

LRTI 

NPA 

10 Jan 2003 

246 

9 months 

LRTI 

OPA 

13 Jan 2003 

248 

11 months 

URTI 

OPA 

16 Jan 2003 

466 

40 years 

LRTI c 

BAL 

4 Feb 2003 

496 

7 months 

URTI 

OPA 

25 Feb 2003 


a Seven-month-old patient from whom HCoV-NL63 was cultured. b Patient received bone mar¬ 
row transplant in 2001. c Patient infected with HIV-1; 20 CD4 + cells per mm 3 . URTI, upper 
respiratory tract illness; LRTI, lower respiratory tract illness; NPA, nasopharyngeal aspirate; 
OPA, oropharyngeal aspirate; BAL, bronchoalveolar lavage. 


and lb genes encode the RNA polymerase and proteases that are 
essential for virus replication. A potential pseudoknot structure is 
present at position 12,439 (data not shown), which may provide the 
-1 frameshift signal to translate the lb polyprotein. Genes predicted 
to encode the S, E, M and N proteins are found in the 3' part of the 
genome. The hemagglutinin-esterase gene, which is present in some 
group 2 coronaviruses, is not present in HCoV-NL63. ORF3, located 
between the S and E genes, probably encodes a single accessory non- 
structural protein; this gene showed only limited similarity to ORF4A 
and ORF4B of HCoV-229E and ORF3 of porcine epidemic diarrhea 
virus (PEDV). 

The la and lab polyproteins are translated from the genomic RNA, 
but the remaining viral proteins are translated from subgenomic 
mRNAs made by discontinuous transcription during negative strand 
synthesis 26 . Each subgenomic mRNA has a common 3' end, derived 
from the 5' portion of the genome (the 5' leader sequence), and com¬ 
mon 3' coterminal parts. Discontinuous transcription requires base¬ 
pairing between as-acting transcription regulatory sequences (TRSs), 
one located near the 3' part of the viral genome (the leader TRS) and 
others located upstream of each of the respective ORFs (the body 
TRSs) 27 . The cDNA bank that we sequenced contained copies of the 
subgenomic mRNA for the N protein, thus providing the opportunity 
to exactly map the leader sequence that is fused to all subgenomic 
mRNAs. A leader of 72 nucleotides was identified at the 5' UTR. 
Eleven of twelve nucleotides of the leader TRS (S'-UCU- 
C AACUA A AC -3') showed similarity with the body TRS upstream of 
the N gene. Putative TRSs were also identified upstream of the S, 
ORF3, E and M genes (Supplementary Table 2 online). 

We next aligned the sequence of HCoV-NL63 with the complete 
genomes of other coronaviruses. The percentage nucleotide identity 
was determined for each gene and is listed in Table 2. All genes except 
the M gene shared the highest identity with HCoV-229E. To confirm 
that HCoV-NL63 is a new member of the group 1 coronaviruses, we 
conducted phylogenetic analysis using the nucleotide sequence of 
the la, lb, S, M and N genes (Fig. 4b). For each gene analyzed, 
HCoV-NL63 clustered with the group 1 coronaviruses. The la, lb 
and S genes of HCoV-NL63 are most closely related to those of 
HCoV-229E. However, further inspection revealed a subcluster of 
HCoV-NL63, HCoV-229E and PEDV. Phylogenetic analysis could 
not be performed for the ORF3 and E genes because the regions were 
too variable or too small for analysis, respectively. Bootscan analysis 
by the Simplot software version 2.5 (ref. 28) found no signs of 
recombination (data not shown). 

The presence of a single nonstructural gene between the S and E 
genes is noteworthy because almost all coronaviruses have two 
or more ORFs in this region, with the exception of PEDV and 
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Figure 4 HCoV-NL63 genome organization and phylogenetic analysis. 

(a) ORFs encoding la, lb, S, 0RF3, E, M and N proteins are flanked by 
286-nucleotide 5' UTR and 287-nucleotide 3' UTR. Coordinates of each 
ORF are provided in Supplementary Table 1 online, (b) Phylogenetic 
analysis of FICoV-NL63, using nucleotide sequences predicted to encode 
la, lb, S, M and N proteins (see Supplementary Methods online for 
GenBank accession numbers). Red, group 1 viruses; blue, group 2; 
green, group 3; purple, SARS-CoV. MHV, mouse hepatitis virus; IBV, avian 
infectious bronchitis virus; BCoV, bovine coronavirus; FCoV, feline enteric 
coronavirus; CCoV, canine coronavirus; FIPV, feline infectious peritonitis 
virus; EqCoV, equine coronavirus; TCoV, turkey coronavirus. 


HCoV-OC43 (ref. 29,30). Perhaps most notable is a large insert of 537 
nucleotides in the 5' portion of the S gene of HcoV-NL63, as com¬ 
pared with that of HCoV-229E. A BLAST search found no similarity 
between this additional 179-amino acid domain of the S protein and 
any coronavirus or other sequence deposited in GenBank. An align¬ 
ment of the HCoV-NL63 S protein sequence with those of other 
group 1 coronaviruses is shown in Supplementary Figure 4 online. 

DISCUSSION 

In this study we present a detailed description of a new human coron¬ 
avirus. Thus far, only three human coronaviruses have been charac¬ 
terized if we include SARS-CoV; further characterization of 
HCoV-NL63 as the fourth member will provide important insight 
into the variation among human coronaviruses. HCoV-NL63 is a 
member of the group 1 coronaviruses and is most closely related to 
HCoV-229E, but the differences between them are prominent. First, 
they share on average only 65% sequence identity. Second, a single 
gene, ORF3, in HCoV-NL63 takes the place of the 4A and 4B genes of 
HCoV-229E. Third, the 5' region of the S gene of HCoV-NL63 con¬ 
tains a large in-frame insertion of 537 nucleotides. The N-terminal 
region of the S protein has been implicated in binding to aminopepti- 
dase N (group I coronaviruses) and sialic acid 31-33 , so the 179-amino 
acid insert in HCoV-NL63 might be involved in receptor binding and 
may explain the tropism of this virus in cell culture. However, the 
aminopeptidase N receptor-binding domain of the HCoV-229E S 
protein has been mapped to amino acids 407-547 (ref. 33), so it seems 
unlikely that the insertion will be directly involved in binding to 
aminopeptidase N. Fourth, whereas HCoV-229E is fastidious in cell 
culture with a narrow host range, HCoV-NL63 replicates efficiently in 
monkey kidney cells. SARS-CoV is also able to replicate in monkey 
kidney cells (Vero-E6 cells 34 ), yet the predicted S proteins of SARS- 
CoV and HCoV-NL63 do not share a domain that could explain the 
in vitro host cell range of these viruses. Other viral proteins may influ¬ 
ence the cell tropism of a virus, but none of the HCoV-NL63 proteins 
were more closely related to SARS-CoV than to HCoV-229E. 

Variability at the 5' end of the S gene, correlating with alterations in 
tropism, has also been described for the group 1 coronaviruses 
porcine respiratory coronavirus (PRCoV) and transmissible gas¬ 
troenteritis virus (TGEV). These porcine viruses are antigenically and 
genetically related, but their pathogenicity is markedly different. 
TGEV replicates in and destroys the enterocytes of the small intestine, 
causing severe diarrhea with high mortality in neonatal swine. In con¬ 
trast, PRCoV (which emerged more recently than TGEV) has a selec¬ 
tive tropism for respiratory tissue, and very little capacity to replicate 
in intestinal tissue. The difference between the TGEV and PRCoV S 
gene sequences is comparable to the difference between those of 
HCoV-NL63 and HCoV-229E 35 . Compared with TGEV, PRCoV con¬ 
tains a deletion in the 5' hypervariable region of the S gene. The extra 
region that is present at the 5' end of the TGEV S gene is responsible 
for the hemagglutination activity of TGEV, and its capacity to bind to 
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Table 2 Percent nucleotide sequence identity between HCoV-NL63 and other 
coronaviruses 



Group 1 



Group 2 


Group 3 

Group 2/4 

Gene 

HCoV-229E 

PEDV 

TGEV 

BCoV 

MHV 

IBV 

SARS-CoV 

la 

63 

57 

50 

40 

38 

37 

38 

lb 

75 

72 

69 

58 

57 

59 

58 

S 

56 

53 

48 

35 

34 

32 

32 

ORF3 

53 

49 

29 

29 

25 

19 

23 

E 

59 

54 

49 

37 

39 

25 

40 

M 

64 

65 

46 

47 

43 

43 

42 

N 

52 

38 

45 

30 

29 

31 

30 


GenBank accession numbers for viruses are provided in Supplementary Methods online. 

sialic acid 32 . However, this region shows no similarity to the 
HCoV-NL63 insert. 

The common cold-causing virus HCoV-229E can cause more seri¬ 
ous respiratory disease in infants and immunocompromised 
patients 36,37 . Our data indicate that HCoV-NL63 causes acute respi¬ 
ratory disease in children below the age of 1 year, and in immuno¬ 
compromised adults. To date, no known viral pathogen can be 
identified in a substantial portion of respiratory disease cases in 
humans (20-30%; ref. 38). Several assays have been used to diagnose 
coronavirus infections. Traditionally, an antibody test is imple¬ 
mented to measure a rise in titers of antibodies to the human coron¬ 
aviruses HCoV-229E or HCoV-OC43 (ref. 12). Antibodies to 
HCoV-NL63 might cross-react with HCoV-229E, given that these 
viruses are members of the same serotype. If this were the case, 
HCoV-NL63 infections might have been misdiagnosed as 
HCoV-229E. Molecular biology tools such as RT-PCR assays 39,40 
were designed to selectively detect the human coronaviruses 
HCoV-229E and HCoV-OC43, but these assays will not detect 
HCoV-NL63. Even the RT-PCR assay that was designed to amplify all 
known coronaviruses 40 is not able to amplify HCoV-NL63 because 
of several mismatches with the primer sequences. The availability of 
the complete HCoV-NL63 genome sequence means that these 
diagnostic assays can be substantially improved. 

Our results indicate that HCoV-NL63 is present in a significant 
number of respiratory tract illnesses of unknown etiology. 
HCoV-NL63 was detected in patients suffering from respiratory 
disease, with a frequency of up to 7% in January 2003. The virus was 
not detected in more recent samples collected in the spring and 
summer of 2003, which correlates with the fact that human coron¬ 
aviruses tend to be transmitted predominantly in the winter season 12 . 
Future experiments with more sensitive diagnostic tools should yield 
a more accurate picture of the prevalence of this virus and its 
association with respiratory disease. 

METHODS 

VIDISCA method. The virus was cultured on LLC-MK2 cells. Details of virus 
culture and patient descriptions are available in Supplementary Methods 
online. To remove residual cells and mitochondria, 110 pi of virus culture 
supernatant was spun for 10 min at maximum speed (13,500 r.p.m.) in an 
Eppendorf microcentrifuge. To remove chromosomal DNA and mitochondr¬ 
ial DNA from the lysed cells, 100 pi of supernatant was transferred to a fresh 
tube and treated with DNase I for 45 min at 37 °C (Ambion). Nucleic acids 
were extracted as described 41 . A reverse transcription reaction was performed 
with random hexamer primers (Amersham Bioscience) and Moloney murine 
leukemia virus reverse transcriptase (MMLV-RT; Invitrogen). Second-strand 
DNA synthesis was carried out with Sequenase II (Amersham Bioscience), 
without further addition of a primer. A phenol-chloroform extraction was 
followed by ethanol precipitation. 


cDNA-AFLP was performed essentially as 
described 4 , with some modifications. The double- 
stranded DNA was digested with the HmPlI and 
Msel restriction enzymes (New England Biolabs). 
Msel and HmPlI anchors (see below) were subse¬ 
quently added, along with 5U ligase enzyme 
(Invitrogen) in the supplied ligase buffer, for 2 h at 
37 °C. The Msel and HmP II anchors were prepared 
by mixing a top-strand oligonucleotide (S'-CTCG- 
TAGACTGCGTACC-3' for the Msel anchor and 
5 , -GACGATGAGTCCTGAC-3 / for the HmPlI 
anchor) with a bottom-strand oligonucleotide 
(5 , -TAGGTACGCAGTC-3 / for the Msel anchor 
and 5'- CGGTCAGGACTCAT-3 / for the HmPlI 
anchor) in a 1:40 dilution of ligase buffer. Twenty 
cycles of PCR were carried out with 10 pi of the ligation mixture, 100 ng of 
HmPlI standard primer (S'-GACGATGAGTCCTGACCGCW) and 100 ng of 
Msel standard primer (5 / -CTCGTAGACTGCGTACCTAA-3 / ). Five microliters 
of this PCR product was used as input in the second ‘selective’ amplification 
step, along with 100 ng HmPlI N-primer and 100 ng Msel N-primer (the ‘N’ 
indicates that the standard primers were extended with one nucleotide; G, A, T 
or C). The selective rounds of amplification were done using ‘touchdown 
PCR’: 10 cycles of 94 °C for 60 s, 65 °C for 30 s, and 72 °C for 1 min (annealing 
temperature reduced by 1 °C per cycle); 23 cycles of 94 °C for 30 s, 56 °C for 
30 s, and 72 °C for 1 min; and finally 1 cycle of 72 °C for 10 min. Sixteen PCR 
reactions, each with 1 of the 16 primer combinations, were conducted for each 
sample in this selective PCR. The PCR products were analyzed on 4% 
Metaphor agarose gels (Cambrex), and the fragments of interest were cloned 
and sequenced using BigDye terminator reagents. Electrophoresis and data 
collection were performed using an ABI 377 instrument. DNA molecular 
weight markers were from Invitrogen and Eurogentec. 

To detect HIV-1, we used VIDISCA with EcoRl digestion instead of HmPlI 
digestion. VIDISCA was modified for parvovirus B19 detection as follows: the 
reverse transcription step was excluded; only HmPlI digestion and adaptor 
ligation were performed; the first PCR reaction was performed with 35 cycles 
instead of 20; and the first PCR fragments were visualized by agarose gel elec¬ 
trophoresis. Details of cDNA library construction and full genome sequencing 
are available in Supplementary Methods online. 

Diagnostic RT-PCR. A total of 614 respiratory samples were collected from 
493 individuals between December 2002 and August 2003 at the Academic 
Medical Center in Amsterdam. The specimens included oral and nasopharyn¬ 
geal aspirates, throat swabs, bronchoalveolar lavage and sputum. The samples 
had been collected for routine viral diagnostic screening of people suffering 
from upper and/or lower respiratory tract diseases, and the patients consented 
that their samples be used for testing of respiratory viruses that included coro¬ 
naviruses. We used 100 pi of each sample in a Boom extraction 41 . The diagnos¬ 
tic assay was designed based on the sequence of the lb gene. The reverse 
transcription was performed with MMLV-RT (Invitrogen), using 10 ng of 
reverse transcription primer (repSZ-RT, 5 / -CCACTATAAC-3 / ; coordinate 
16232 in HCoV-NL63). The entire reverse transcription mixture was added to 
the first PCR mixture containing 100 ng of primer repSZ-1 (S'-GTGATG- 
CATATGCTAATTTG-3 / ; coordinate 15973) and 100 ng of primer repSZ-3 
(5 / -CTCTTGCAGGTATAATCCTA-3 / ; coordinate 16210). The PCR reaction 
consisted of the following steps: 95 °C for 5 min; then 35 cycles of 95 °C for 
1 min, 55 °C for 1 min, and 72 °C for 2 min; then 72 °C for 10 min. 

A nested PCR was started using 5 pi of the first PCR product with 100 ng of 
primer repSZ-2 (5LTTGGTAAACAAAAGATAACT-3'; coordinate 16012) and 
100 ng of primer repSZ-4 (5 / -TCAATGCTATAAACAGTCAT-3 / ; coordinate 
16181). Twenty-five PCR cycles were performed using the same profile as the 
first PCR. Ten microliters of each PCR product was analyzed by agarose gel 
electrophoresis. All positive samples were repeated and sequenced to confirm 
the presence of HCoV-NL63. To verify negative and positive PCR results, an 
additional diagnostic RT-PCR assay was conducted using the la gene primers 
5 / -AATATGTCTAACAAATAAAACGATT-3 / (reverse transcriptase primer 
P4H10-3; coordinate 6667), 5 , -CTTTTGATAACGGTCACTATG-3 / (SS 5852- 
5P; coordinate 5777) and 5 / -CTCATTACATAAAACATCAAACGG-3 / 
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(P4G1M-5-3P; coordinate 6616) in the first PCR; and 5 r -GGTCACTATG- 
TAGTTTATGATG-3' (P3E2-5P; coordinate 5788) and 5 r - 
GGATTTTTCATAACCACTTAC-3 / (SS 6375-3P; coordinate 6313) in the 
nested PCR. Details of sequence analysis are available in Supplementary 
Methods online. 

GenBank accession numbers. The HCoV-NL63 sequences were deposited in 
GenBank under accession numbers AY567487-AY567494. 

Note: Supplementary information is available on the Nature Medicine website. 
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